The Annals of Statistics 

2004, Vol. 32, No. 6, 2712-2730 

DOI: 10.1214/009053604000000689 

© Institute of Mathematical Statistics, 2004 
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We consider the problem of approximating the moment gener- 
ating function (MGF) of a truncated random variable in terms of 
the MGF of the underlying (i.e., untruncated) random variable. The 
purpose of approximating the MGF is to enable the application of 
saddlepoint approximations to certain distributions determined by 
truncated random variables. Two important statistical applications 
are the following: the approximation of certain multivariate cumu- 
lative distribution functions; and the approximation of passage time 
distributions in ion channel models which incorporate time interval 
omission. We derive two types of representation for the MGF of a 
truncated random variable. One of these representations is obtained 
by exponential tilting. The second type of representation, which has 
two versions, is referred to as an exponential convolution represen- 
tation. Each representation motivates a different approximation. It 
turns out that each of the three approximations is extremely accurate 
in those cases "to which it is suited." Moreover, there is a simple rule 
of thumb for deciding which approximation to use in a given case, 
and if this rule is followed, then our numerical and theoretical results 
indicate that the resulting approximation will be extremely accurate. 

1. Introduction. 

1.1. Saddlepoint methods. Saddlepoint methods provide approximations 
to densities and probabilities which are very accurate in a wide variety of 
settings. This accuracy is seen not only in numerical work, but also in theo- 
retical calculations. In particular, it is often the case that relative errors of 
these approximations stay bounded in the extreme tails, a desirable property 
which is not shared by most other types of approximation used in statistics. 
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For development and discussion of saddlepoint methodology and related 
methods, see Daniels (1954) for details of the density approximation; Barndorff- 
Nielsen and Cox (1989, 1994) for applications to inference; Lugannani and 
Rice (1980), Temme (1982) and Daniels (1987) for discussion of a tail area 
approximation which has uniform relative error, and Skovgaard (1987) for 
a conditional version of this approximation; and Reid (1988) for a review of 
saddlepoint techniques. 

Saddlepoint approximations are constructed by performing various op- 
erations on the moment generating function (MGF) or, equivalently, the 
cumulant generating function (CGF), of a random variable. Let X be an 
absolutely continuous random variable with density f{x) with respect to 
the Lebesgue measure, moment generating function M(t) and CGF Kit) = 
logM(i). Then the first-order saddlepoint density approximation to f(x) is 
given by 

/(*) = {2nK"(t )}~ l l 2 e W {K(i) - ix}, 

where t = t is the (unique) solution to the saddlepoint equation K'(t) = x, 
and primes denote derivatives. The Lugannani and Rice (1980) saddle- 
point approximation to the cumulative distribution function (CDF) F{y) = 
P(X < y) is obtained by taking 9 = 0, F = F and K = K in (19). 

More recent developments include saddlepoint approximations for non- 
linear statistics. See Daniels and Young (1991), DiCiccio and Martin (1991) 
and Jing and Robinson (1994) for further details, and see Jensen (1995) for 
a rigorous account of the underlying mathematical theory of saddlepoint 
methods. An extensive discussion of saddlepoint methods and their applica- 
tion will appear in Butler (2004). 

Unlike much of this previous work, the current paper uses saddlepoint 
methods to approximate MGFs of truncated distributions with the view 
that these approximate MGFs may be used for further saddlepoint inversion. 
The work is therefore similar in spirit to Fraser, Reid and Wong (1991) and 
Butler and Wood (2002a). 

1.2. Truncation. Suppose that X{ denotes a random variable with known 
MGF Mi{0) for i = 1, . .. ,n, and that for each i we observe Y% = Xi\Xi G 
(a,i,bi), that is, Yi is Xi conditioned to lie in the interval (ai,bi). In this 
paper we are concerned with the following question: is there a convenient 
and accurate way to approximate the CGF ofYi using only Ki(9), the CGF 
of the untruncated variable Xi? 

If we are just interested in a single random variable, Y% say, then the 
question is probably not of much interest because the density and CDF of 
Y\ can be expressed simply in terms of the density and CDF of X\, with the 
latter approximated using the saddlepoint approximations indicated above. 
However, there are situations in which approximations to the CGFs of the 
{Y{} are potentially very useful. We mention two such examples. 
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1.2.1. Computation of Dirichlet probabilities. We may wish to construct 
a saddlepoint approximation for the distribution of the sum X^Li^i- One 
such application is to the approximation of certain multivariate CDFs aris- 
ing in sampling theory and extreme value theory as discussed in Butler 
and Sutton (1998). For these applications, the multivariate CDF is ex- 
pressed in terms of the density of Ya=iYii where the underlying MGFs 
of the Xi are known. Consider, for example, the probability that an ar- 
bitrary Dirichlet vector D = (D±, . . . , D n ) ~ Dirichlet{7 = (71, . . . , 7„)} lies 
in a general rectangular region (a, b) = rnt=i( a i>M C (0, l) n . If the compo- 
nents of X = (X\, . . . ,X n ) are independent with Xi ~ Gamma(7j, 1), then 
the Dirichlet is represented in terms of independent Gammas as D = X/5, 
where S = J27=i-^i- By independence of S and X/5, the distribution of D 
is also the conditional distribution of X given that 5 = 1. These facts and 
Bayes' theorem lead to 



Here /<?{1|X G (a, b)} is the density of Z = Y2=i *i at 1, where Y i = X l \X i £ 
(a,i,bi), which we approximate using the saddlepoint density. The other terms 
are standard computations: PrjA^ G (aj,6j)} is a gamma probability and 
/s(l) is the Gamma(Xi ! Ji, 1) density of S at 1. 

1.2.2. Ion channel models with time-interval omission. First, we con- 
sider an ion channel model which is represented as a two-state homogeneous 
semi-Markov process with state space {o,c}, where state o (state c) corre- 
sponds to the ion channel being open (closed). Suppose that we observe the 
process T 0> o,T C) i,T 0; i,T Cj 2, ■ ■ ■ , where T Q j is the length of the jth sojourn 
in the open state, and T c ^ is the length of the kth sojourn in the closed 
state. We have assumed that the process has started in state o; one could 
equally start in state c. Homogeneity and the semi-Markov assumption im- 
ply that {T j : j > 1} and {T Cj fc : k > 1} are both independent and identically 
distributed (IID) sequences. Suppose that the MGFs of T 0j \ and T Cj i are, 
respectively, Q oc {9) and & co {6) and that both are convergent in open neigh- 
borhoods of zero. 

In ion channel modeling, a phenomenon known as time interval omis- 
sion is commonly built into the model. In effect, this means that only state 
residences which last for longer than a given time threshold are observed 
(or detected), and those residences lasting for less than this threshold are 
not observed (or are undetected); that is, we only observe those sojourns 
in state o (state c) which last at least r Q (r c ); otherwise, it appears to the 
observer that a jump has not occurred. Time interval omission occurs be- 
cause of limitations in the sensitivity of the measuring device. Denote the 
observed sequence by T 0j o, T C) i, T 0; ±, .... As a concrete example, suppose that 



(1) 



Pr{DG(a,b)}=/ 5 {l|XG(a,b)} 



nr = iPr{X,G(a,A 
fs(l) 



)} 
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T c ,i > t c , r 0) i < To and T 0i2 > t c ; then T ofi = T Oj0 , T Cjl = T Cj i + T 0j i + T C)2 . 
The sequences {T Q j :j> 1} and {T Cj fc : A; > 1} are both IID. Let & c(8) and 
Q co {6) denote the MGF of a typical member of each sequence. For inferential 
purposes it is important to express 3> oc and $ co in terms of <3? oc and $ co . 
Define 

(2) $£(0) = £[exp(0r o!l )I(T O)1 > 7b)], 

(3) $£(0) = E[e W {0T o ,i)I(T o>1 < t )] 
and 

vr o = P[T Oil >r o ]=$f c (0), 

with corresponding definitions for 3>^, <E>^ and 7r c . Elementary arguments 
show that ^ocl^) can be expressed in terms of a geometric series: 

oo 
n=0 

A similar argument shows that 

$ co (0) =7T- 1 $£{1 - C(^)*co(0)}-V o . 

The above discussion shows that time interval omission leads directly to 
consideration of MGFs of truncated random variables. 

More interesting ion channel models have several open states and sev- 
eral closed states, some of which communicate; see Ball, Milne and Yeo 
(1991). This leads to a more complicated structure, due to aggregation, in 
which $oc(^) and & co (6) now represent matrices, each component of which 
is essentially an MGF which can be expressed as a rational function of the 
MGFs of the underlying distributions of transition times between individual 
states. These rational functions are difficult to write down explicitly, but 
they are straightforward to compute numerically using matrix algebra; see 
Ball, Milne and Yeo [(1991), Section 3] and also Butler (2000) for analogous 
calculations in a reliability context. The key point is that, in multistate ion 
channel models, 3> oc and $ co are matrices rather than real numbers, but 
have similar form to that given above, and are expressed in terms of the 
MGFs of truncated random variables, as in (2) and (3); see Ball, Milne and 
Yeo [(1991), Section 4]. Accurate approximation of these distributions can 
be performed using the methods developed in this paper, but seems to be 
very difficult otherwise (except in the Markov case). 

The present paper was motivated by the ion channel application described 
above. Further details of this application will be presented in Ball, Butler 
and Wood (2004). 
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1.3. Outline of the paper. In Section 2 we consider two types of repre- 
sentation for the MGF of a truncated random variable expressed in terms of 
the MGF of the underlying random variable. One of these representations is 
obtained by exponential tilting. A second type of representation, which has 
two versions, is referred to as an exponential convolution representation. In 
Section 3 we consider saddlepoint approximations to the MGF of the trun- 
cated random variable which are motivated by these representations, and 
indicate their performance in a number of examples. In Section 4, results 
concerning the tail behavior of the various approximations are given. Proofs 
of the theorems are given in the Appendix. The research report Butler and 
Wood (2002b) presents extensions to the lattice and multivariate cases, as 
well as additional numerical examples. 

It turns out that each of the three approximations is extremely accurate 
in those cases "to which it is suited." Moreover, there is a simple rule of 
thumb (see Section 3.4) for deciding which approximation to use in a given 
case. If this rule is followed, numerical and theoretical results indicate that 
the resulting hybrid approximation will be extremely accurate. 

2. Representations of truncated MGFs. 

2.1. Preliminaries. Let M (6) denote the MGF and K (9) = logM (6>) 
the CGF of a random variable X on R with density /o with respect to the 
Lebesgue measure, and CDF Fq{x) = P(X < x). Assume that Mq(9) has a 
convergence strip given by 9 £ {—a, P), where < a, (3 < oo. Let a < b denote 
real numbers such that Fo(b) — Fq(o) > 0. 

Let 

(4) M {a , b) (0) = fe 9x dF (x) 

F (b) - F (a) J a 

denote the MGF of X truncated at a and b, and conditioned to lie in (a, b). 
We shall refer to M^f,) W as a truncated MGF which is an abbreviation for 
the MGF of a truncated random variable, and similar terminology is used 
for other quantities such as the CGF. 

In this paper we discuss how to approximate the truncated CGF tC( a ^ (9) = 
log M.( a ,b) W an d its derivatives in terms of the original CGF Kq{6) = log Mq{9) 
and its derivatives. 

2.2. Tilted representation. Let Fg{x) denote the CDF of the 0-tilted dis- 
tribution of X, that is, dFg(x) = fe{x) dx = e 8x dF Q (x)/M Q (9), where fe(x) = 
e 9x fo{ x ) I 'Mo (9) is the density corresponding to the CDF Fg. Then, for 
9 6 (— a, P), elementary manipulations show that 

(5) M (a , b) (9) = M (9)[{F e (b) - F e (a)}/{F (b) - F (a)}]. 
We shall refer to (5) as the tilted representation of Mf a ,b)(^)- 
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2.3. Exponential convolution representations. We now provide alterna- 
tive representations of (4) which are collectively valid for all 9 in the con- 
vergence interval of M-u^y Define 

(6) E l (9,y) = — M (s)- ds, -a < a < minQ3, 9) 

Am Jci -too 9 - s 



and 

I rc 2 +ico e (8~s)y 

(7) E 2 {9,y) = — M (s) -ds, max(-a, 9) < c 2 < (3. 

ZTTl Jc2—ioo S u 

Theorem 2.1 (Properties of Hi and E 2 ). Suppose that Fq is absolutely 
continuous with density fo, and assume that for some cG (— a, (3), there 
exists a v{c) € (0, 00) such that 

(8) / \M (c + it)\ 1+u ^dt< 00. 

JteR 

Then the following results hold: 

(i) We have 

(9) Si(0,y)= f e 6x f (x)dx, 9e{-a,oo) 

J — 00 

and 

(10) E 2 (9,y) = e dx f (x)dx, 9e(-oc,(3). 

Jy 

Hence, 

(11) ~ 1 (9,y) + Z 2 (9,y) = M (9), 9e(-a,/3). 

(ii) Let X denote a random variable with MGF Mq(9) and let E denote 
an exponential random variable with rate parameter \9\ which is independent 
ofX. When9>0, 

(12) Z 1 (9,y)=9- 1 e d yf x+E (y); 
and when 9 < 0, 

(13) E 1 (9,y) = M (9) - \9\~ l e e y f x . E {y). 

In the statement of this theorem fz denotes the density of a random variable 
Z. 

(iii) When 9 > 0, 

(14) 3 2 (0, y) = M (9) - 9- l e e yf x+E (y) ] 
and when 9 < 0, 

(15) Z 2 (9,y) = \9r 1 e e yf x „ E (y). 



SADDLEPOINT METHODS FOR TRUNCATION 



7 



(iv) In the respective domains of definition for Hi and E, 2 , 

M { _ OOiy) (6) = Z 1 (8,y)/F (y) and 

(16) 

M {y , oo) (9)=E 2 (9,y)/{l-F (y)}. 

(v) For a general interval (a,b), Mu ^iO) has the alternative represen- 
tations 

(17) M (a>b) {9) = {E 1 (9,b)-E 1 (9,a)}/{F (b)-F (a)}, Be (-a, 00), 
and 

(18) M {a , b) (9) = {E 2 (9,a)-Z 2 (9,b)}/{F (b)-F (a)}, 0€(-oo,/3). 

We refer to (16)-(18) as exponential convolution representations of the 
corresponding truncated MGFs. 

Remark 2.1. Condition (8) is a mild smoothness requirement on the 
underlying density /q. Note that if, for some c, (8) holds with v(c) £ (0, 1], 
then absolute continuity of Fq follows; see Theorem 11.6.1 in Kawata (1972). 
However, if we must take v(c) > 1 for all c, then Fq need not be absolutely 
continuous; see Theorem 13.4.2 in Kawata (1972) for a counterexample. 

Remark 2.2. Although (11) follows immediately from the addition of 
(9) and (10), it is also interesting to note that (11) is a consequence of 
Cauchy's theorem; see Butler and Wood [(2002b), Section 2]. 

3. Approximations. We now present approximations to the truncated 
CGF /C( o m(0) = logM.^ ab ^(9) an d its derivatives, distinguishing between 
the one-sided cases a = —00 and b = y < 00, and a = y > —00 and b = 00, 
and the two-sided case a > — 00 and b < 00. 

3.1. Lugannani and Rice approximation. Using the tilted representation 
of the truncated MGF, we obtain 

IC i _ OO)y) (8)=K (9) + log{F e (y)/F (y)}. 

We may approximate the 0-tilted CDF Fg(y) by applying the Lugannani 
and Rice approximation with the CGF Kq(s) = Kq(9 + s) — Kq(9). 

If the convergence strip of Kq{9) is 9 G (—a, (3) with finite @, then )C(_ 00t y\(9) 
is defined on the larger set (—a, 00), but it is not clear how to extend this 
approximation to 8 G 00). A simple extension is discussed in Butler and 
Wood [(2002b), Section 5.2], though it turns out that this extension is un- 
satisfactory. 
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The Lugannani and Rice approximation to Fg(y) is given by 

(19) F e (y) = $(w e ) + ^(wg^wj 1 - Ug 1 ), 

where $ and (f> are, respectively, the standard normal CDF and density; 

(20) w e = sgn(i y - 6)[2{{t y - 9)y - K (t y ) + K (9)}} 1 / 2 

and uq = (t y — 0){Kq (ty)} 1 ' 2 , where sgn(x) = —1,0, 1 depending on whether 
x is negative, zero or positive; and t = t y is the unique solution to the sad- 
dlepoint equation K' Q (t) = y. 

The approximation Fg (y) is quite simple to use since it is an explicit func- 
tion of 9 once t y , the saddlepoint for = 0, has been determined; thus, the 
function /C(_ OO)J/ )(0) is available in explicit form once the single saddlepoint 
solution t y has been obtained. To see this, note that the saddlepoint for the 
tilted distribution sg solves 

K'g(sg)=K' (sg+9)=y = K' (t y ). 

By uniqueness of the saddlepoint sg + 9 = t y , so that only the computation 
of t y is required in order to determine {so: 9 £ (— a,/3)}. Thus, the CGF 
approximation 

(21) )C { _ 00>y) (e)=Ko(9) + log{Fg(y)/Fo(y)}, 0e(-a,P), 
is explicit in 9. 

The first two derivatives of the approximation are given by 

^ oo , y) {9) = K(9) + {%{y)r 1 d%{y)/d0 

and 

Kg-eo.dP) = K(9) + {F e (y)r l d 2 F e {y)/d9 2 - [{^(y)}" 1 dF e (y)/d9} 2 , 
where 

dF e (y)/d9 = 4>(we)[{y - K(9)}(wf - u^ 1 ) - (t y - 9)~ 2 {K^t y )Y 1 l\ 

and the second partial derivative d 2 Fg(y) / d9 2 is most easily obtained by 
numerical differentiation. 

In the case of JCr yj00 \(9), for 9 £ (— a, (3), we have the approximations 

(22) £(,,00) = K (9) + log[{l - Fg(y)}/{1 - F (y)}}, 
KfooojW = K 'M - {1 - F e (y)r l dF e (y)/d9 

and £"y,o O )( )= K o( )- T (O,y), where 

T(6, y) = {l- My)}' 1 d 2 F e {y)/d9 2 + [{1 - Fg(y)y l dF e (y)/d9] 2 , 
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with the partial derivatives of Fg{y) the same as before. 

For general a < b, we may approximate JCr a> t,\(9) = \ogM.^ a ^{9) by 

£(«,&) (0) = K Q {6) + log[{F e (b) - F e (a)}/{F (b) - F (a)}], 

(23) 

9€(-a,P). 

This is an explicit expression in 6 once two saddlepoints have been deter- 
mined by solving K' (t a ) = a and K^t^) = b. 

The Lugannani and Rice approximation is exact when applied to the CDF 
of an arbitrary normal distribution. Therefore, (23) is exact in this case. 

3.2. The exponential convolution approximations. These approximations 
are obtained by applying saddlepoint approximations to the integrals defin- 
ing E±(9, y) and E 2 {9, y). Denote these saddlepoint approximations by E\{9, y) 
and £2(0, y). Then in this approach the CGFs K-(-oo,y){@) an d fc( y ,oo)(@) are 
approximated by 

(24) ^(-oo,„)W=log{Hi(fl,y)/S 1 (0,y)}, 9 > -a, 
and 

(25) }C {y>oo) (9)=log{E 2 (9,y)/E 2 (0,y)}, 9<p. 

To reduce the number of formulae in this section, we shall use the subscripts 
1 and 2 to indicate the intervals (— 00, y) and (y,oo), respectively. 
The saddlepoint approximations to Ej(9,y) (j = 1,2) are given by 

(26) Ej(9,y) = [2it{1 + (9- s jy 9 fK{s j: e)}}~ 1/2 exp{K Q ( Sj , e ) - {s jfi - 0)y}, 
where Sj t $ is the unique solution to 

(27) K^s) + {9-s}- l = y 

in (— at, ft) which satisfies <9 (j = 1) and s 2 ^ > 9 (j = 2). 
After some simplifications, we obtain 

(28) iCj(9) = 9y + D 3 {9) - D 3 {Q) + K (s j>e ) - K (s jfi ) - ( 8jfi ~ Sj,o)y, 
where, using implicit differentiation, we have 

Dj(9) = \ \og{ds jfi /d9) = -±log{l + (9- Sji g) 2 K^ 8j>e )}. 

Note that the approximations are calibrated so that /Cj(0) = /Cj(0) = 0, j = 
1,2. 

The first derivative of fCj{9) (j = 1,2) is given by 

= y + D'j(0) - {y - K' Q {s jfi )} ds^/89, 
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where 

D'jie) = \{d 2 s jfi /d9 2 )/(d Sj . e /de), ds^/dO = {i + (e-s jfi fKZ(s jfi )y l 
and 

d 2 s jtd (9 - s jt e) 2 [KH f ( Sjte ) + 2(9 - s^jK^s^)} 2 } 
de 2 {l + {e-s h e) 2 K'i(s jfi )Y 

The second partial derivative d 2 KLj(9) / d9 2 can be determined using numer- 
ical differentiation. 

In some examples we considered the second-order saddlepoint approxi- 
mation to 3j given by Ej(9,y) = Ej(9,y)R(9,y), where R(9,y) is the usual 
second-order term given in this case by 

1 KS"(s j>e ) + 6(9 - Sj , e )- A 5 + 2(9 - s^)" 3 } 2 



m,v) l + s{K ,, {Sje) + {e _ s . e) -2 } 2 24 {K%{8 jfi ) + (9-8 ifi )-*y ' 

The resulting approximations to K,i a ^\(9) based on (17) and (18) are 

(29) ^ li(O)6) (e)=log[{Hi(fl,6)-3 1 (e j a)}/{S 1 (0,6)-S 1 (0 J a)}] 
for 9 G (—a, oo), and for 9 G (— oo, /?), 

(30) iC W) (9) = \og[{3 2 (9, a) - E 2 (9, b)}/{E 2 (0, a) - 3 2 (0, &)}]. 

3.3. Summary of numerical results. We now discuss several examples 
which have been chosen to illustrate some general points. A more extensive 
set of examples is given in Butler and Wood (2002b). As before, the trunca- 
tion occurs at — oo<a<6<oo and the convergence strip of the underlying 
CGF K is (— a, (3), where < a,/5 < oo. 

1. In cases where truncation leads to an extension of the convergence strip 
of the MGF (i.e., if either a > — oo and a < oo, or b < oo and (5 < oo, or both) 
the most obvious way to extend the LR-based approximation of Section 3.1 is 
described in Butler and Wood [(2002b), Section 5.2]. However, this extended 
approximation is poor, as can be seen in Figure 1. The discussion in Butler 
and Wood [(2002b), Section 5.2] indicates that this is a general problem and 
not specific to this example. 

2. Theoretical results (see Theorem 4.2 and Section 5.3) indicate that 
when an exponential convolution approximation is used it is appropriate to 
use (24) or (29) in the right tail and (25) or (30) in the left tail. These 
findings are strongly supported by our numerical examples; see Figure 2 for 
a typical case. 

3. In our numerical examples, we have found that the first-order saddle- 
point approximation to Hj works better in the case of one-sided trunca- 
tion (i.e., if either a = — oo or b = oo), while the second-order approximation 
works better in the case of two-sided truncation (i.e., if a > — oo and b < oo). 
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4. In those cases where the convergence strip does not need to be ex- 
tended, the LR-based approximation has generally proved more accurate 
than the appropriate exponential convolution approximation, though the 
latter performs respectably. Figures 3 and 4 present a typical example of 
this finding. 




Fig. 2. Two-sided truncation of Normal(0,l) . Plot of /Ci,(_i,2) (0) (dotted), /C2,(-i,2) {0) 
(dashed), and fci-i^) {&) (solid ) for 8 G (—5,6). 
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5. Finally, we return to Example 1.1. The question of interest here is how 
accurately we can approximate rectangular Dirichlet probabilities using the 
truncated MGF approximations described above, thereby avoiding the exact 
computation of the truncated CGF, which is difficult. Table 1 presents re- 
sults for particular examples. The exponential convolution approximations 
show consistent accuracy when the saddlepoint is positive; and the Lugan- 
nani and Rice approximations are consistently accurate when the saddle- 
points are negative. Inaccuracy only arises when either approximation is 
used in the inappropriate tail. 

3.4. Rule of thumb. The results of Section 3.3 suggest the following rule 
of thumb for choosing the approximations, which has worked very well in 
all the examples we have looked at. In the rule, left and right tail refer to 
9 < and 6 > 0, respectively. 

Approximation for right truncation (—oo,y). Use the Lugannani and 
Rice approximation (21) in both tails with one exception. If /3 < oo, so the 
convergence strip is extended in the right tail, then use (24) in the right tail. 

Approximation for left truncation (y,oo). Use the Lugannani and Rice 
approximation (22) in both tails if there is no extension in the left tail. With 
extension due to a > — oo, use (25) in the left tail instead. 




Fig. 3. Two-sided truncation of the Gumbel distribution. Plot of /C(_i_2) (0) (solid) and 
the "rule of thumb" approximation (centers of circles) that uses the Lugannani and Rice 
approximation (23) for 6 < and the exponentially convoluted approximation (29) for 
9>0. 
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Fig. 4. Plot of lOOx absolute error for the "rule of thumb" approximation (solid) in 
Figure 3. If the exponentially convoluted approximation (30) replaces (23) for 6 < 0, then 
the error is shown as the dashed line. 



Approximation for two-sided truncation (a,b). Use the Lugannani and 
Rice approximation (23) in those tails in which there are no extensions. 
Where extensions occur in the left and/or right tails, use (30) and/or (29), 
respectively. 

Since all the approximations to the truncated CGF are calibrated to be 
zero at 9 = 0, it follows that the approximation obtained by following the rule 
of thumb will be continuous but, in general, not continuously differentiable 
at 6 = 0. However, we have not found the lack of smoothness at 6 = to be 
an issue in practice. 

4. Theoretical accuracy in the tails. We now investigate the behavior 
of the approximations to M( a ,b){Q) and K-^J^iO), t = 1,2, as \9\ — > oo. We 
make the following assumptions throughout this section: 

(Al) The exponential family {Fg : £ (—a, (3)} is steep, that is, \Kq(6)\ — > oo 

as 6 I —a and as 6 j /3. 
(A2) The density Jq has one-sided limits at the truncation points a and b, 

that is, the limits lining /o(a + e) = fo(a+) and lim e jo fo(b — e) = fo(b—) 

both exist. 

Note that under (Al) and (A2) and regardless of the value of a > —oo we 
have, as 8 — > oo, 

M, h , (e) ^(^ l e db Mb-)/[F (b)-F (a)}, if b < oo, 
•M(a, & )W \M (e)/[l-F (a)}, if6 = oo, 
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Table 1 

Dimchlet probability computations; see Example 1.1. For the various values ofn,y and 
(a, b), the "Exact" probability as listed was computed using symbolic computation in 
Maple V. The mean of Z is listed in the cell "Mean" and its value relative to value 1 

determines whether the listed saddlepoints for methods Kz and Kz are negative or 
positive. Category U SA, Kz using Hi" approximates the CGF of each Yi by using the 
appropriate (one-sided or two-sided ) second-order exponential convolution approximation 
based on Hi given in (24) and (29), respectively. Upon determination of Kz, the results 

of its first-order saddlepoint density inversions are listed. The final column U SA, Kz 
using L&R" shows comparable computations using the LR-based approximation Kz given 

by (21) or (23), as appropriate 



n 


7 


a 


Exact 


SA, Kz using Hi 


SA, Kz using L&R 




Mean 


b 




Saddlept. 


Saddlept. 


3 


(10,8,8) 


(0) 3 


0.9527 


0.8877 


0.9756 


3 


1.454 
(10,8,8) 


11 10 11 

19 ' 19 ' 19 

(0) 3 


0.02400 


-23.6 
0.02479 


-23.5 
0.00141 




0.9435 


0.45, (0.3) 2 




36.3 


0.9667 


3 


(I) 3 


(0.2) 3 


0.04000 


0.03869 


0.0117 




0.9080 


(0.4) 3 




11.50 


10.26 


3 


(I) 3 


(0) 3 


0.04000 


0.04059 


0.02831 




0.5707 


(0.4) 3 
(O) 5 




15.59 


0.9203 


5 


(I) 5 


0.3680 


0.3540 


0.2535 




0.9268 


(0.4) 5 

(o) 5 




1.00 


0.1335 


5 


(1,2,. ..,5) 


0.5526 


0.5469 


0.5439 




1.389 


(0.4) 5 
(O) 5 




-10.14 


-9.93 


5 


(1,2,. ..,5) 


0.0 6 2288 


0.0 6 2336 


0.0 7 5733 




0.7221 


0.5,0.4,(0.1)3 




12.65 


0.9140 


5 


(1,2,. ..,5) 


(0.1) 5 
(0.3) 5 


0.03220 


0.03217 


0.03183 




1.125 




-10.02 


-9.15 


10 


(I) 10 


(1/15) 5 


0.0 4 5080 


0.0 4 3175 


0.0 4 4969 




1.776 


(0.3) 5 




-29.56 


-28.56 



and regardless of the value of b < oo we have, as 9 — > — oo, 

M, < e) ^{0- l e ea Ha+)/[F (b)-F,{a% ifo>-oo > 
M(a,b){V) \ Mo{e)/Fo ( b ), ifo = -oo. 

4.1. Accuracy of the Lugannani and Rice approximation. We first con- 
sider the accuracy in the tails of the Lugannani and Rice (LR) approximation 
■M.{a,b) an d its logarithmic derivatives ^w m an d K-( a b)- Theorem 4.1 below 
is proved in the Appendix. 

Remark 4.1. Comparison of the results in Theorem 4.1 with the limit- 
ing results for M( a ,b)(@) above shows that the relative error stays bounded 
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in all cases. With fc'f a fy(9) and IC" ab ^(9), the errors actually go to zero as 



oo. 



Theorem 4.1. Consider the LR approximation M.( a> b){6) specified in 
Section 3.1. Assume that (Al) and (A2) both hold. Suppose also that (i) a = 
oo in all statements concerning the left tail and P = oo in all results con- 
cerning the right tail; and (ii) as \9\ — ► oo, uq/wq — > 0, where wg and ug are 
given in (20) and below (20), respectively, with y = a orb as appropriate. 

(i) As 9 ->■ oo, 

■M(a,6)(0) 

and as # — > — oo, 
■M(<M#)~ 



I 0- 1 e e6 /o(6-)/[i i b(6) - ^o(a)], i/ 6 < oo, 
\M (#)/[1-F (a)], i/6 = oo, 

^ 1 e ea / (a+)/[FoW - fb(a)], i/a > -oo, 

M o (0)/F o (&), ifa = -oo, 



where /o is £/ie saddlepoint density approximation to /o and i<o is i/ie Lu- 
gannani and Rice approximation to the CDF Fq . 
(ii) ^4s 9 — > oo, 

^(a )6 )W \^(0)(1 +O (1)), i /6 = oO, 

and as 9 ^ — oo, 



(iii) ^4s 6* — > oo, 



a - + o(9~ 1 ), if a >-oo, 
me){l + o(l)), </a = -oo. 



r" J 6 * 2 ' ifb<oo, 

and as # — > — oo, 

r" J 61-2 ! if a >-oo, 

K-{a,b)\y) | Kq(9), ifa = -oo. 

4.2. Accuracy of the exponential convolution approximation. For j = 1, 2, 
let Sj(0,i/) and £j. r) (0), r = 0,1,2, be as in Section 3.2 and define A4j(9) = 
Ej(9,y)/Ej(0,y). Also, for — oo<a<6<oo, define 

M 1>M (0) = A (0,6) - Hi(e,a)}/{Hi(0,6) - 3i(0,a)}, 

K-i.(a,b)(9) = log A / (i > ( ai fe)(0), with corresponding definitions for M-2,(a,b){^) 
and £ 2 ,(a,&)(0)- 
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Remark 4.2. Comparison of the results in Theorem 4.2 with the lim- 
iting results for M.t a fi\(Q) shows that the relative error stays bounded in all 

cases. With KL'AO) and ICj(9), the errors actually go to zero as 9 — > ±oo in 
the cases covered by the theorem. 

Theorem 4.2. Assume that (Al) and (A2) both hold. 

(i) As 9 -> oo, 

Mi(0) ~ e- x e^Uy-)/E x {Q, y), iC[(9) = y-9~ 1 + o{9~ 1 ) 
and /C[(9)~9- 2 . 

(ii) The limiting behavior of M2, fc'2 an d ^-2 * n the lower tail is the same 
as that of M\{9), K,'i(6) and fC'{{9) in the upper tail, as given in part (i). 

(hi) If —00 < a < b < 00, then as 9 — > 00, 

M Ua , b) {9) ~ 0-V 6 /o(6-)/{2i(O,&) - Hi(0,o)}, 

^,mW = 6-« _1 + o(0 ondr? i(ai6) (0)~e- 2 . 

(iv) If 6 —> —00, then M.2,(a,b)(,Q) an d the derivatives o//C 2) ( 0) 6) o&ey re- 
sti/te corresponding to those in part (hi), &ui w«#i a replacing b. 

Proof. In part (i), the key point to note is that s± t $ — > t y as 9 — > 00, 
and then the proof follows easily. The proof is essentially the same in the 
other cases. □ 



4.3. Behavior in the other tail. In Theorem 4.2 we described the limiting 
behavior of M\{9) and its logarithmic derivatives as 9 — > 00, and the behav- 
ior of M.2{9) and its logarithmic derivatives as 9 — > — 00. In this section we 
indicate, without proof, what happens to M.\{9) and its derivatives when 
9 — > —00. The results for M2(6) are similar and are therefore omitted. 

If lim si _ Q i^(s)/[i^( S )] 2 ^0, then 

(31) M 1 (9)^M (9)(e/V2^)/^i(0,y) as 9 ^ -00. 
Under the stronger conditions 

]hnKS(s)/K' (8)^0 and lim (s)/[K' (s)} 3 -> 0, 

si— a si— a 

we have 

(32) K! x (e)~K' Q {p); 

and still stronger conditions are needed to ensure that 

(33) K&e)~KS{0). 
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A sufficient condition for (31)-(33) to hold is the following: 
(34) for each j > 2 lim Kq ^ (s) stays bounded. 

si— a 

Note that condition (34) holds for the normal distribution, gamma distribu- 
tion (in the left tail) and any other distribution which has bounded support 
on the left. However, in the case of —X, where X has a gamma or inverse 
Gaussian distribution, or if X has a logistic distribution, then KL'i{9) and 
K.'{(6) do not stay bounded as 9 { —a, and (31)-(33) fail to hold. 

APPENDIX 



Proof of Theorem 2.1. Using the convolution formula for densities 
[see, e.g., Theorem 6.1.2 in Chung (1974), for a precise statement], we have, 
for 9 > 0, 

/ e te / (x) dx = — / ee-^y- u h(n < y)f (u) du = —fx+E(y), 

J-oo V J -oo V 

where E is an exponential random variable with rate parameter 9, which is 
independent of X. Define 

H Ci6 (t) = [M (c + it)/{l - (c + it)/6}]/[M (c)/{l-c/e}], 

so that Ho : ff(t) is the characteristic function (CF) of fx+E{y), and H C! g(t) 
is the CF of the c-tilted density f x +E(y)e cy /{M (c)/(1 - c/6)}. Note that 
if (8) holds for some c E (— a,/3), then (8) holds for all c S (— a, (3). For a 
proof of this result, which involves two applications of the Hausdorff- Young 
inequality, see Lemma 2.3.4 of Jensen (1995). Using Holder's inequality, 

roc 1 _ r /a / poo \ l/(l+v(c)) 

x [J-oo \i-(c + it)/e\v+"W/"M dt ) < 00 

for each c€ (—a,mm(/3,9)). Therefore we may apply the Fourier inversion 
theorem [see, e.g., Chung (1974), page 155, for a precise statement] to H c fi(t) 
to obtain 

(35) — / H cfi {t)e~ lt y dt = f x+E {y)e c y/{M {c)/(l - c/6)}. 

After some rearrangement, we find that (35) gives (9) for all 9 > and 
cG (— a,mm(j3, 9)). This shows also that Ri(9,y) does not depend on the 
choice of c\ in (6). An analytic continuation argument extends (9) to 9 € 
(— a,0], thus (9) is established for all 9 > —a. 
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Identical reasoning gives (10) and (15), and (11) follows immediately after 
adding (9) and (10). The statements (16), (17) and (18) follow directly from 
the definitions. □ 

Proof of Theorem 4.1. The LR approximation to Mr a b)(9) is given 

by 

M (a , b) (9) = M (9)[{F e (b) - F e (a)}/{F (b) - F (a)}]. 

(i) Case 9 — > oo, b = oo. Note that Fg(b) = 1 for all 9 and Fq(cl) — > as 
9 — > oo, so M^ h ){9) ~ Mq(9)/[1 - F (a)] as required. 

Case 9 — > oo, b < oo. Here Fg(a)/Fg(b) — > 0, so 

M (a>b) (9) ~ Mo(e)F fl (6)/[F (6) - F (a)]. 

By assumption uq/wq — > as — > oo. Moreover, elementary calculations 
show that as — > — oo, <3?(u/6i) ~ — ^(u^)]?^ 1 + ], and it then follows 
easily that 

F fl (6) ~ -<t)(w e )/u e ~ 9- 1 e w f (b-)/M (9) as ^ oo, 

where / (6) = (2vr)- 1 / 2 |Ko'(t fe )|- 1 / 2 exp{i^ (4) - is the saddlepoint ap- 
proximation to fo(b). The proofs for 9 — > — oo with a > — oo are similar. 

(ii) We have 

= K(f>) + {Fe(b) - F e {a)y 1 [dF e (b)/d9 - 8F e (a)/89]. 

Case 9 -> oo, 6 = oo. Since -F e (a) -> 1, 8F e {b)/89 = 0, 8F e (a)/89 
and Kq(9) — > oo, the result follows. 
Case — ► oo, 6 < oo. Here 

F fl (a)/F fl (6) -> and {8F {a) / 89) / {8F e {b) / 89) 0. 

Therefore, 

£ U W = (0) + {^(b)}" 1 9^(6)/Se + o^- 1 ) 
= K' (9) + b-K' Q {9) + 9- 1 + o{9- 1 ) 
= b + 9- l + o(9~ l ), 

as required. The cases 9 — > — oo with a = — oo and a > — oo are proved in 
similar fashion. 

(iii) The results here follow from similar but more extensive calculations. 

□ 
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